Method for optimally utiilizing a peer to peer network

ABSTRACT

In accordance with an embodiment of the invention, there is provided a method for splitting a load of monitoring a peer to peer network. The method has a first node in the peer to peer network receiving information reported by a second node in the peer to peer network, including information concerning which nodes are connected to the second node. The method further has the first node connecting to nodes different from the nodes that are connected to the second node using the received information. The method still further has the first node reporting information, including information concerning which nodes are connected to the first node, such that the second node also has access to the reported information, thereby splitting the load of monitoring the peer to peer network among the first and second nodes.

RELATED APPLICATIONS

This application is a divisional of U.S. application Ser. No. 11/039,981, filed Jan. 21, 2005, which is a continuation-in-part of U.S. application Ser. No. 10/764,111, filed Jan. 23, 2004.

The entire teachings of the above applications are incorporated herein by reference.

FIELD OF THE INVENTION

The present invention provides a method for optimally utilizing peer to peer networks, and, in particular, to optimally utilize peer to peer networks to increase the amount of communications messages received.

BACKGROUND OF THE INVENTION

As used herein, peer to peer networks which are the subject of the present invention comprise multiple nodes, each node typically consisting both of file server and client which can send and receive communication messages or information to or from a node to which such is connected.

In a peer to peer network each node is connected to other nodes over a communication medium such as the internet either directly or through some type of proxy. For example, when a search request is issued such originating node sends a search request to all of the nodes to which it is connected (see FIG. 1) These nodes search their list of available files and if a match is found they send a response back with the location. However, a peer to peer proxy network typically consists of node A which is connected to a node B and node B is connected to a node C. (see FIG. 2) Node A is not connected to node C such that if node A issues a search request it will be forwarded to node B and Node B will search its available files and if a match is found it will send a response back to node A. Node B will then forward node A's request to node C and Node C will search its available files and if a match is found it will send a response back to node B. Node B will then forward this response to Node A. FIG. 3 discloses a nonproxy loop network wherein each node is directly connected to another.

Some peer to peer networks utilize a leaf node/main node proxy topology (See FIG. 4) where some nodes are classified as main nodes and the remaining nodes are classified as leaf nodes. Leaf nodes can only connect to main nodes. Only main nodes can connect to other main nodes. When a leaf node issues a search request it sends the request to the main node that it is connected to. The main node then forwards the request to any other leaf nodes that are connected to it and also to any main nodes it is connected to. These main nodes forward the request to any leaf nodes that are connected to them.

Accordingly it is an object of the present invention to provide a method for optimally utilizing a peer to peer network. It is yet another object of the invention to provide a method for reducing the number of connections required from a single node on a peer to peer network to view most, if not all, communication messages. It is yet another object of the invention to provide a method for optimally connecting to the network. It is yet another object of the invention to provide a method for locating nodes specific distances away from a first node.

SUMMARY OF THE INVENTION

Generally, the present invention provides a method for optimally utilizing a peer to peer network by controlling how a node connects into the network and by controlling how a node locates optimal nodes as well as by how the node interacts with the network and other nodes.

In one embodiment a first node issues a search for preselected information to help locate other nodes by watching responses. In another embodiment a first node issues a ping and compares the hops value to a predefined optimal value. In yet another embodiment a first node maintains connection statistics and compares these to a configured optimal value. In yet another embodiment a first node continuously clears its host cache at predetermined times.

In all of the embodiments, a node is configured to have one or more of the features set forth below. These features are employed in the invention to optimally utilize a peer to peer network as compared to the other network nodes on the particular network being addressed not so optimized. Thus, not all of the capabilities need to be programmed into each node in order to optimally utilize the network. The presently preferred methods of the present invention include:

configuring a node to send pings and review the distance parameters contained in the results.

configuring a node to send preconfigured searches and review the distance parameters contained in the results.

configuring a node to clear or modify its host cache based on a comparison of its host cache size.

configuring a node to clear or modify its host cache based on comparison of how long its host cache has existed.

configuring a node to throttle its connection attempts.

configuring a node to drop connections based on calculations of duplicate communication messages received.

configuring a node to drop connections based on the last time a transmission was received from a connection.

configuring a node to drop connections based on how well the connection is performing when compared to other connections.

configuring a node to connect to other similar nodes or a master node and share processing of the network.

Other advantages of the present invention will become apparent from a perusal of the following detailed description of presently preferred embodiments of the invention taken in connection with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing will be apparent from the following more particular description of example embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating embodiments of the present invention.

FIG. 1 is a simplified schematic of a two node peer to peer network;

FIG. 2 is a simplified schematic of a peer to peer proxy network;

FIG. 3 is a simplified schematic view of a peer to peer, nonproxy, loop network;

FIG. 4 is a simplified schematic of a peer to peer leaf/main node network;

FIG. 5 is a simplified schematic of a peer to peer network consisting of more than 5 hops;

FIG. 6 is a simplified schematic of a peer to peer network with nodes sharing the load;

FIG. 7 is a simplified schematic of a peer to peer network with nodes sharing the load but not yet connected;

FIG. 8 is a simplified schematic of a peer to network with a node throttling its connections;

DETAILED DESCRIPTION OF THE INVENTION

A description of example embodiments of the invention follows.

Generally, peer to peer networks are quite large, often a million or more nodes. To reduce the bandwidth required to operate such networks, nodes have a community imposed transmission distance limitation. Most communication messages contain communication radius parameters such as hops. Hops is a value that normally starts at 0 and increments each time the communications is forwarded. When hops reaches a preset limit, often 5, the communications is dropped from the network. This effectively enforces a community “time to live” value and limits the number of nodes that would receive the communications from a particular transmitting node. It therefore would be optimal and advantageous to connect in such a way that a node would be within reach of all communication messages.

In one embodiment of the invention a first node wishing to be optimally connected to a second node issues a search request containing a preconfigured search term. This search term can be any term but preferably one that will match many files on other nodes. As other nodes available through the second node respond to the first node, the first node looks at the hops value of their responses and compares it to a value which value can be preconfigured by the operator. Such value can be generated by a mathematical calculation based on other values, or it can be in relation to other values. If the hop value is equal or greater than the compared value, the first node will attempt to connect to the node sending the response. If the hop value is less than the compared value the first node will not attempt to connect to the node sending the response. This method allows the first node to connect to node that are N hops away from currently connected nodes and expands its communications radius.

In another embodiment, the first node connects to a second node and issues a ping rather than a second request. As other nodes available through the second node respond to the first node, the first node looks at the hops value of their responses and compares it to a value which can be preconfigured by the operator. Such value can be generated by a mathematical calculation based on other values, or it can be in relation to other values as in the first embodiment. If the hop value is more or greater than the compared value, the first node will attempt to connect to the node sending the response. If the hop value is less than the compared value the first node will not attempt to connect to the node sending the response. This method allows the first node to connect to node that are N hops away from currently connected nodes and expands its communications radius.

In another embodiment a first node seeking to locate other nodes on the network for connection purposes issues a search request containing a preconfigured search term. This search term can be any term but preferably one that will match many files on other nodes. As other nodes available through the second node respond to the first node, the first node attempts to connect to them or adds them to a cache to be connected to later.

Referring to FIG. 3, it is possible for a first node to be connected to other nodes which are within non-optimal distances from each other and these other nodes themselves having a second path to the first node. Other non-optimal connections are possible but the result of that is the first node would receive duplicate communications messages. It would benefit the first node if it could detect this situation, thus, in another embodiment of the invention, the first node maintains a count of duplicate communication messages which are received from each node. At intervals the first node will use the amount of duplicate communication messages in a preconfigured equation such as a comparison to a value, which value can be preconfigured by the operator, a value generated by a mathematical calculation based on other values or it can be in relation to other values. The comparison can be any comparison, for instance greater or less than or an average of. If using the equation, the node detects the connection is not optimal or meeting a certain criteria the first node will disconnect that connection.

It is possible for a first node to connect to a second node which is not connected to any other nodes or the second node may be configured to not forward any communications. In this situation the second node would be deemed unproductive. Thus, in another embodiment of the invention, the first node maintains a count of received communications messages for its connections. At intervals the first node will use the amount of received communication messages in a preconfigured equation such as a comparison to a value. This value can be preconfigured by the operator, it can be a value generated by a mathematical calculation based on other values or it can be in relation to other values. The comparison can be any comparison, for instance greater or less than or an average of. If using the equation, the node detects the connection is not optimal or meeting a certain criteria the first node will disconnect that connection.

In another embodiment the first node maintains a count of searches it has received from each connection. At intervals the first node will use these counts in a preconfigured equation such as a comparison to a value. This value can be preconfigured by the operator, it can be a value generated by a mathematical calculation based on other values or it can be in relation to other values. The comparison can be any comparison, for instance greater or less than or an average of. If using the equation, the node detects the connection is not optimal or meeting a certain criteria the first node will disconnect that connection.

In another embodiment the first node maintains the last time the node received a communication message on a specific connection. At intervals the first node will use the last transmission time in a preconfigured equation such as a comparison to a value, which value can be preconfigured by the operator, generated by a mathematical calculation based on other values or it can be in relation to other values. The comparison can be any comparison, for example greater or less than or an average of. If using the equation, the node detects the connection is not optimal or meeting a certain criteria the first node will disconnect that connection.

In some situations it may be preferable to drop connections that are not performing as well as the average of other connections or connections that are not performing within a certain percentage of the average of other connections or against a predefined performance range. Thus, in such a case, in another embodiment, the first node would keep specific communications statistics on its connections and at intervals calculate the average of these statistics and drop those connections that are below average or drop those connections that are below some percentage of the average.

Sometimes a node can get overloaded processing communications on peer to peer networks. In this situation it would be advantageous to be able to split the load of processing communications. Accordingly, in another embodiment of the invention, multiple nodes can connect to the network at different points and share the load. These multiple nodes would maintain communications paths between themselves or to a master node and transmit and receive information about what other network nodes and where each node is connected. This would allow multiple nodes to share the load. These nodes may also report back to a master node with the searches they are processing.

When connecting many times to a network, a load is placed on the resources of the node in relation to the number of connection attempts are occurring at one time. It would be a benefit to the node if it had some way to control or throttle multiple connection attempts to the network. In this embodiment of the invention, the node is configured for a set number of concurrent connection attempts. As connections are accepted, the node will add new connection attempts to maintain this set value. Without this method, a node wishing to connect to 1,000 other nodes would attempt 1,000 concurrent connections. With the method, and configured for a maximum number of 50 concurrent connections, the node would attempt 50 concurrent connection attempts to the network. As these connection attempts succeeded or failed the node would add enough new connection attempts to reach the set limit of 50. Once the limit of 1,000 connections are established the node would not attempt any further connections.

Although limiting and controlling the concurrent number of connection attempts by the node reduces load, it is sometimes desired to initially start with a large number of concurrent connection attempts and then limit the number to a set value. In another embodiment the node is configured to attempt only a set number of concurrent connection attempts. When the connection attempts first start, the node attempts as many connections as possible until the number of successful connections reaches some value. This value can be preconfigured by the operator, generated by a mathematical calculation based on other values, or it can be in relation to other values. Once this value is reached the node will reduce its attempts to the limited concurrent connection method described above.

In many cases, a first node may connect to a second node and after some time the second node may stop transmitting without the first node knowing of such occurrence. This second node may stop transmitting because of technical problems or it may stop transmitting because it is no longer being utilized. It would be a benefit to the first node to drop the connection just as a precaution after some time has past. In one embodiment the first node is configured to keep track of when it connected to a second node. After some configured or calculated time limit is reached, the first node drops the connection and attempts to connect to either the same node again or to a different node.

As nodes connect to the network they are constantly receiving address information about other nodes to which it can be potentially connected. As these new nodes are discovered they are added to a cache. This cache is used to provide the node with potential new connections. Some nodes have a set limit on the number concurrent connections they can have. Should their set limit be reached they will not connect to any further nodes but they will continue to add any newly discovered nodes to their cache. Should a node maintain very long connections, nodes in this cache may become invalid for various reasons. When the node finally looses connections and attempts to connect to nodes in the cache, resources are consumed and wasted because the nodes are invalid. Thus, in yet another embodiment, the node is configured to add nodes to its cache as normal but also configured to clear this cache at set intervals or when the cache reaches a certain limit. By constantly clearing the cache a reduction in invalid nodes is achieved.

EXAMPLES

The following Examples illustrate various embodiments of the methods according to the present Invention.

Example 1

Referring to FIG. 5, this example illustrates a method for obtaining hop information from search requests and using this information to optimally connect to the network.

In this example node A is connected to node B and wishes to optimally connect into the rest of the network. The network is configured to allow communication messages to travel a maximum of 5 hops so node A is configured to look for nodes 5 hops away. Each node contains a file called “Samuel.txt.” Node A sends out a search message to the network via node B with the term “Samuel.txt.” Nodes A, B, C, D, E and F all respond. Node A reviews each search response and finds that node F is 5 hops away. Node A connects to node F. Node A sends out a search message to the network via node F with the term “Samuel.txt.” Nodes B, C, D, E, F, G, H, I and R respond. Node A reviews each search response and finds that nodes B and R are 5 hops away. Node A knows that it is already connected to node B so it connects only to node R. Node A sends out a search message to the network via node R with the term “Samuel.txt.” Nodes F, G, H, I, R, Q, P, 0 and N respond. Node A reviews each search response and finds that nodes F and N are 5 hops away. Node A knows that it is already connected to node F so it connects only to node N. Node A sends out a search message to the network via node N with the term “Samuel.txt.” Nodes J, K, L, M, N, O, P, Q, and R respond. Node A reviews each search response and finds that nodes J and R are 5 hops away. Node A knows that it is already connected to node R so it connects only to node J. Node A sends out a search message to the network via node J with the term “Samuel.txt.” Nodes J, K, L, M, and N respond. Node A reviews each search response and finds that node N is 5 hops away. Node A knows that it is already connected to node N so it does not connect. Node A is now within 5 hops of all nodes and will receive all communications from all nodes.

Example 2

Referring again to FIG. 5, example 2 illustrates a method for obtaining hop information from pings and using this information to optimally connect to the network

In this example, node A is connected into node B and wishes to optimally connect into the rest of the network. The network is configured to allow communication messages to travel a maximum of 5 hops so node A is configured to look for nodes 5 hops away. Node A sends out a ping message to the network via node B. Nodes A, B, C, D, E and F all respond. Node A reviews each response and finds that node F is 5 hops away. Node A connects to node F. Node A sends out a ping to the network via node F. Nodes B, C, D, E, F, G, H, I and R respond. Node A reviews each response and finds that nodes B and R are 5 hops away. Node A knows that it is already connected to node B so it connects only to node R. Node A sends out a ping message to the network via node R. Nodes F, G, H, I, R, Q, P, 0 and N respond. Node A reviews each response and finds that nodes F and N are 5 hops away. Node A knows that it is already connected to node F so it connects only to node N. Node A sends out a ping message to the network via node N. Nodes J, K, L, M, N, 0, P, Q, and R respond. Node A reviews each response and finds that nodes J and R are 5 hops away. Node A knows that it is already connected to node R so it connects only to node J. Node A sends out a ping message to the network via node J. Nodes J, K, L, M, and N respond. Node A reviews each response and finds that node N is 5 hops away. Node A knows that it is already connected to node N so it does not connect. Node A is now within 5 hops of all nodes and will receive all communications from all nodes.

Example 3

Referring to FIG. 5, example 3 illustrates a method for locating other nodes so that more connection options exist.

In this example Node A wishes to find other nodes to connect to. Node A is already connected to node B. The network is configured to allow communication messages to travel a maximum of 5 hops. Each node contains a file called “Samuel.txt.” Node A sends out a search message to the network via node B with the term “Samuel.txt.” Nodes A, B, C, D, E and F all respond. Node A reviews each search response and uses the address information contain in the message to connect to these nodes.

Example 4

Referring to FIGS. 2 and 3, example 4 illustrates a method for optimizing a node's connections by looking at the number of duplicate messages that exist.

Referring to FIG. 3, B seeks to locate the file “Samuel.txt” and sends a search request out both of its connections to nodes A and C. Node C receives the search request. Node A receives the search request. Node A forwards the search request to node C. Node C records that it has received a duplicate message from node A. Node C finds that it has been configured to drop connections when it receives 1 duplicate message so it drops the connection to node A. Node C can still see searches from node A because they will travel through node B. Now referring to FIG. 2, the end result is that only one connection is needed to receive all communications from the network.

Example 5

Referring to FIG. 4, example 5 illustrates a method for optimizing a node's connections by monitoring the number of communication messages received on a connection.

In this example, main node 4 wishes to optimize its connections by monitoring how many communication messages it is receiving from all connections and comparing them to an average. If a connection does not meet the average it will disconnect the connection. Main node 4 records the following statistics:

Main node 2 has sent 1 communication message

Main node 3 has sent 1 communication message

Leaf node G has sent 1 communication message

Leaf node H has sent 1 communication message

Main node 4 then waits, for example, 5 minutes, and records the following statistics:

Main node 2 has sent 51 communication messages

Main node 3 has sent 53 communication messages

Leaf node G has sent 54 communication messages

Leaf node H has sent 1 communications message

Main node 4 adds the delta of all messages together and divides by 4 to get an average of 38.75. Because main node 4 is configured to drop any connections below the average, it will drop the connection to leaf node H.

Example 6

Referring again to FIG. 4, example 6, illustrates a method for optimizing a node's connections by monitoring the time of the last transmission received on a connection.

In this example, main node 4 is programmed to optimize its connections by monitoring when the last time its connections received a communication message and comparing them to a value. If a connection has not received any communication messages within 1 minute the node will drop the connection. Main node 4 records the following statistics:

Main node 2 has sent 1 communication message

Main node 3 has sent 1 communication message

Leaf node G has sent 1 communication message

Leaf node H has sent 1 communication message

Main node 4 then waits 1 minute and records the following statistics:

Main node 2 has sent 51 communication messages

Main node 3 has sent 53 communication messages

Leaf node G has sent 54 communication messages

Leaf node H has sent 1 communications message

Because main node 4 is configured to drop any connections that have not received any communication messages within 1 minute it will drop the connection to leaf node H.

Example 7

Referring to FIG. 4 again, example 7 illustrates a method for optimizing a node's connections by monitoring the number of search requests received on its connections.

In this example, main node 4 is programmed to optimize its connections by monitoring how many search requests it is receiving from all connections and comparing them to an average. If a connection does not meet the average it will disconnect the connection. Main node 4 records the following statistics:

Main node 2 has sent 1 search request

Main node 3 has sent 1 search request

Leaf node G has sent 1 search request

Leaf node H has sent 1 search request

Main node 4 then waits, 5 minutes and records the following statistics:

Main node 2 has sent 51 search request

Main node 3 has sent 53 search request

Leaf node G has sent 54 search request

Leaf node H has sent 1 search request

Main node 4 adds the delta of all messages together and divides by 4 to get an average of 38.75. Because main node 4 is configured to drop any connections below the average, it will drop the connection to leaf node H.

Example 8

Referring to FIG. 6, example 8 illustrates a method for splitting the load among multiple nodes and reporting the information to a master node. The master node also keeps track of which network nodes the load sharing nodes are connected to.

Here, Nodes 1, 7 and 13 are depicted as sharing the load of monitoring a network at optimal points. Node 1 is connected to node 2 and reports this information to master node A. Node 7 is connected to node 8 and reports this information to master node A. Node 13 is connected to node 14 and reports this information to master node A. Node 7 wishes to connect to node 2 and sends this request to master node A. Master node A knows that node 1 is connected to node 2 and denies the request.

Node 2 issues a search request for “samuel.txt.” Node 1 receives this communications message and forwards it to master node A. Master node A records the information. Node 17 issues a search request for “bob.txt.” Node 13 receives this communication message and forwards it to master node A. Master node A records this information.

Example 9

Referring to FIGS. 6 and 7, example 9 illustrates a method for splitting the load among multiple nodes and reporting the information to a master node. The master node also informs the load sharings nodes which network nodes to connect to.

Referring first to FIG. 7, in this example Nodes 1, 7 and 13 wish to join the network. Nodes 1, 7 and 13 send communication messages to master node A requesting clients to connect to. Master node A replies to node 1 with connection information for node 2. Master node A replies to node 7 with connection information for node 8. Master node A replies to node 13 with connection information for node 14.

Nodes 1, 7 and 13 connect and are sharing the load of monitoring a network at optimal points. Node 1 is connected to node 2 and reports this information to master node A. Node 7 is connected to node 8 and reports this information to master node a. Node 13 is connected to node 14 and reports this information to master node A. Node 7 wishes to connect to node 2 and sends this request to master node A. Master node A knows that node 1 is connected to node 2 and denies the request.

Node 2 issues a search request for “samuel.txt.” Node 1 receives this communications message and forwards it to master node A. Master node A records the information. Node 17 issues a search request for “bob.txt.” Node 13 receives this communication message and forwards it to master node A. Master node A records this information.

Example 10

Referring to FIG. 8, example 10 illustrates a method for throttling connection attempts to a network.

In this example node C wishes to connect to a maximum of four other nodes. In its cache it has the following entries:

Node A

Node H

Node L

Node V

Node B

Node O

Node E

Node D

Node C is configured to only have a maximum of 2 concurrent connection attempts and to wait 10 seconds for each connection attempt. Node C attempts to connect to node A and node H. Node C connects to node A and establishes a connection. Node C continues to wait for the connection attempt to node H. Because Node C connected to node A, there is now one empty connection slot so node C attempts to connect to Node V. The connection attempt to node H fails so there is now one empty connection slot. Node C attempts to connect to node B and this connection attempt succeeds. Because once again there is one empty connection slot node C attempts to connect to node O. An error occurs immediately and at the same time the connection attempt to node V fails as well. There are now two empty connection slots available. Node C attempts to connect to node E and node D. Node C's connection attempt with node D is successful. After 10 seconds, the connection attempt to node E fails.

While presently preferred embodiments have been described and depicted, the invention may be otherwise embodied within the scope of the following claims:

While this invention has been particularly shown and described with references to example embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims. 

What is claimed is:
 1. A method for splitting a load of monitoring a peer to peer network, the method comprising: at a first node in the peer to peer network, (a) receiving information reported by a second node in the peer to peer network, including information concerning which nodes are connected to the second node; (b) connecting to nodes different from the nodes that are connected to the second node using the received information; and (c) reporting information, including information concerning which nodes are connected to the first node, such that the second node also has access to the reported information, thereby splitting the load of monitoring the peer to peer network among the first and second nodes.
 2. The method of claim 1 wherein the information reported by the second node includes search requests received by the second node.
 3. The method of claim 1 wherein receiving the information reported by the second node includes receiving the information from a master node; and wherein the second node reports the information directly to the master node.
 4. The method of claim 1 wherein connecting to different nodes includes requesting a master node to return connection information for a node to connect to, the master node returning the connection information to the first node using the information reported by the second node.
 5. The method of claim 1 wherein reporting the information includes reporting the information to a master node; and wherein the second node receives the information reported by the first node from the master node.
 6. The method of claim 1 further comprising storing, in a cache, the nodes to connect to, which are different from the nodes that are connected to the second node.
 7. The method of claim 6 further comprising clearing the cache.
 8. A method for utilizing a peer to peer network having a plurality of nodes, the method comprising: a. defining a maximum concurrent connection value; b. attempting through a first node, multiple connections to nodes in the peer to peer network up to the maximum concurrent connection value; and c. adding new connection attempts up to the concurrent connection value as connection attempts succeed or fail.
 9. A method as set forth in claim 8 wherein the first node attempts more than the maximum concurrent connection value until a defined number of successful connection attempts.
 10. A method for utilizing a peer to peer network having at least a first node and a second node, the method comprising: a. connecting the first node to the second node; b. recording in the first node the time of the connection; c. comparing at an interval, the first node connection time to a value; and d. disconnecting said first node from said second node when the value is reached.
 11. A method for utilizing a peer to peer network having a plurality of nodes, said method comprising: a. at a node in the network, obtaining address information on the plurality of nodes; b. adding the address information to a cache; and c. clearing the cache when a specific event occurs.
 12. A method as set forth in claim 11 wherein the event is that the cache has reached a specific size.
 13. A method as set forth in claim 11 wherein the event is that the connection information in the cache has reached a specific age. 